Weakly Supervised U-Net with Limited Upsampling for Sound Event Detection

نویسندگان

چکیده

Sound event detection (SED) is the task of finding identities sound events, as well their onset and offset timings from audio recordings. When complete timing information not available in training data, but only are known, SED should be solved by weakly supervised learning. The conventional U-Net with global weighted rank pooling (GWRP) has shown a decent performance, extensive computation demanded. We propose novel limited upsampling (LUU-Net) threshold average (GTAP) to reduce model size, computational overhead. expansion along frequency axis decoder was minimized, so that output map sizes were reduced 40% at convolutional layers 12.5% fully connected without performance degradation. experimental results on mixed dataset DCASE 2018 Tasks 1 2 showed our GTAP about 23% faster achieved 0.644 tagging 0.531 tasks terms F1 scores, while GWRP 0.629 0.492, respectively. major contribution proposed LUU-Net reduction time being maintained or improved. other method, GTAP, further improved provides versatility for various mixing conditions adjusting single hyperparameter.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Weakly Supervised Action Detection

Detection of human action in videos has many applications such as video surveillance and content based video retrieval. Actions can be considered as spatio-temporal objects corresponding to spatio-temporal volumes in a video. The problem of action detection can thus be solved similarly to object detection in 2D images [3] where typically an object classifier is trained using positive and negati...

متن کامل

Towards Accurate Event Detection in Social Media: A Weakly Supervised Approach for Learning Implicit Event Indicators

Accurate event detection in social media is very challenging because user generated contents are extremely noisy and sparse in content. Event indicators are generally words or phrases that act as a trigger that help us understand the semantics of the context they occur in. We present a weakly supervised approach that relies on using a single strong event indicator phrase as a seed to acquire a ...

متن کامل

Weakly Supervised Object Detection with Posterior Regularization

Motivation: In weakly supervised object detection where only the presence or absence of an object category as a binary label is available for training, the common practice is to model the object location with latent variables and jointly learn them with the object appearance model [1, 5]. An ideal weakly supervised learning method for object detection is expected to guide the latent variables t...

متن کامل

Sentence Subjectivity Detection with Weakly-Supervised Learning

This paper presents a hierarchical Bayesian model based on latent Dirichlet allocation (LDA), called subjLDA, for sentence-level subjectivity detection, which automatically identifies whether a given sentence expresses opinion or states facts. In contrast to most of the existing methods relying on either labelled corpora for classifier training or linguistic pattern extraction for subjectivity ...

متن کامل

Collaborative Learning for Weakly Supervised Object Detection

Weakly supervised object detection has recently received much attention, since it only requires imagelevel labels instead of the bounding-box labels consumed in strongly supervised learning. Nevertheless, the save in labeling expense is usually at the cost of model accuracy. In this paper, we propose a simple but effective weakly supervised collaborative learning framework to resolve this probl...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Applied sciences

سال: 2023

ISSN: ['2076-3417']

DOI: https://doi.org/10.3390/app13116822